Model Selection

Stable Training

# Stable Training

Ppo LunarLander V2

This is a reinforcement learning model based on the PPO algorithm, designed to solve control tasks in the LunarLander-v2 environment.

Td3 MountainCarContinuous V0

A TD3 reinforcement learning agent trained based on the stable-baselines3 library, specifically designed for the MountainCarContinuous-v0 environment.

This is a TD3 agent model trained using the stable-baselines3 library, specifically designed for reinforcement learning tasks in the Hopper-v3 environment.

Td3 HalfCheetah V3

This is a TD3 reinforcement learning agent trained using the stable-baselines3 library, specifically designed for the HalfCheetah-v3 environment, achieving an average reward of 9709.01.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase